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DIRECT HAPLOTYPING USING CARBON NANOTUBE PROBES 

Background of the Invention 

The Human Genome Project is now providing massive amoxmts of genetic 
5 information that should revolutionize both the understanding and diagnosis of inherited 
diseases. In particular, the cataloging of single nucleotide polymorphisms (SNPs) in gene 
coding and regulatory regions should lead to a greater comprehension of the genetic 
contribution to risk for common diseases such as cancer and heart disease. To achieve 
maximum power, the haplotype of a subject — ^the specific alleles associated with each 

10 chromosome homologue — ^is a critical element in SNP mapping. However, the cunent 
methods for determining haplotypes have significant limitations that have prevented their 
use in large-scale genetic screening. For example, parental genotyping can be used to infer 
haplotypes in a family study, although in many cases it is impractical or impossible to 
obtain parental DNA. Furthermore, molecular techniques for determining haplotypes, such 

15 as allele specific or single molecule PCR amplification, are hampered by the need to 
optimize stringent reaction conditions and the potential for significant error rates. 

Current sequencing methods may lead to expanded SNPs databases, and it has been 
proposed that such databases could be used in whole-genome association and linkage 
studies to determine the many interacting genetic eflfects fliat determine susceptibility to 

20 diseases such as cancer. Recent studies suggest, however, fliat the exploitation of SNPs for 
determining disease risk and pharmacogenomics will not be so easy. There are several 
reasons for emerging diflSculties, including that most of the SNPs identified are likely 
unrelated to gene function (coding and/or regulatory regions). Without complete knowledge 
of all SNPs, which is a daunting task when one considers the costs of sequencing control 

25 and patient genomes from different population groups, direct association may not yield 
robust information about disease risk. There are additional considerations as well. For 
example, an individual heterozygous at n-sites (SNPs), could have 2n possible haplotypes, 
and each of these haplotypes may exhibit subtle variations in phenotype. 

These latter observations suggest that the haplotype of a subject — ^the specific 

30 alleles associated with each chromosome homologue — ^is a critical element in SNP 
mapping. However, the current methods for determining haplotypes have significant 
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limitations that have prevented their use in large-scale genetic screening. For example, 
parental genotyping can be used to infer haplotypes in a family study, although in many 
cases it is impractical or impossible to obtain parental DNA. Furthermore, molecular 
techniques for determining haplotypes, such as allele specific or single molecule PGR 
S amplification are hampered by the need to optimize stringent reaction conditions and the 
potential for significant eiror rates. These latter methods are also difficult to apply to repeat 
sequences, which can be an important component of regulatory regions of genes. 

Current methods of SNP mapping based on polymerase chain reactions (PGR) and 
fluorophore-labeled detection involve multiple biomedical reaction steps which result in a 

10 complicated and costly detection process, i.e., a few dollars per base, and these restrictions 
hinder their use for identifying polymorphisms in long DNA. Furthermore, these methods 
are not conducive for direct genomic DNA (gDNA) analysis. Moreover, this conventional 
method of SNP detection typically provides information about genotype, but genotyping 
alone does not contribute enough specifics to unambiguously determine tiie functional 

1 5 abnormalities of a gene. Haplotype determination is a decisive measurement in SNP 

mapping since haplotyping of a subject— differentiating the specific alleles associated with 
each chromosome homologue — provides the most incisive and complete information for 
understanding genetic contributions. Gonventional methods for determining haplotypes 
require acquiring complete parental genetic information in a pedigree which is extremely 

20 difficult to obtain in practice. This requirement intensifies difficulties experienced by the 
traditional haplotyping methods flirough introducing additional limitations to the pre- 
existing complications due to the need for allele-specific or single molecule PGR 
amplifications. Haplotyping is conventionally achieved by deduction since circumstances 
are frequently left with insufficient genetic resources from a pedigree and, thereby, this 

25 inferring method is prone to errors. Straightforward, PCR-fi:ee haplotyping is highly 
warranted in large-scale genetic screening for population studies but not practical at 
present 

Hence, there appears to be a clear need for new technology that can determine 
directiy sample haplotype with a high throughput. 


30 
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Summarv of the Invention 

The present invention is directed to haplotyping by direct visualization of 
polymorphic sites on individual biosequences. According to one aspect of &e invention, a 
method is disclosed for detecting single nucleotide polymoiphisms of gene samples using 
5 an atomic force microscope (AFM). In one embodiment, the method of detecting single 
nucleotide polymorphisms of gene samples comprises the steps of a) providing the AFM 
wifli at least one nanotube tip, b) moving said nanotube tip across the gene sample, and c) 
recording an image obtained with said AFM. 

In an embodiment, the gene sample comprises DNA. In an embodiment, the gene 

1 0 sample comprises a DNA fragment. . In an embodiment, the gene sample comprises an 
amplified DNA fragment using polymerase chain reaction. In another embodiment, the 
gene sample is genondc DNA (gDNA). The method can be used to read directly multiple 
polymorphic sites in DNA fragments containing from about 100 to at least 10,000 bases, at 
least 1 3,000 bases, or even at least 20,000 bases 

15 In an embodiment, the biosequence or gene sample may comprise a probe to detect 

a single base mismatch e.g. single nucleotides polymorphorisms, or more flian one single 
base mismatches in a gene. In certain embodiments, the probe corresponds to a molecule, 
biosequence, or a genetic or peptide process which creates a feature which is detectable by 
atomic force microscopy. In certain further embodiments, the probe is a peptide nucleic 

20 acid (PNA). In certain other embodiments, the probe is an oligonucleotide. 

In an embodiment, the probe comprises a label or detectable moiety. In a further 
embodiment, an oligonucleotide probe is labeled with a molecule which can be detected by 
atomic force microscopy. In a further embodiment, the oligonucleotide probe is labeled 
wifli streptavidin. In an embodiment, the gene is gDNA. In a further embodiment the gDNA 

25 is labeled with a peptide nucleic acid probe. In an even further embodiment, the gDNA is 
labeled with a peptide nucleic acid probe:with SEQ ID 1 : 5' CTTTATGCCACAGAGCTA 
3*. In yet another embodiment, the gene sample contains more than one label or probe. 

• In another embodunent, the gene has more than one single nucleotide 
polymorphism. In a further embodiment, the single nucleotide polymorphisms further 

30 comprise probes. In an even further embodiment, the probes comprise. different peptide 
nucleic acids. 
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In an embodiment, the throughput of this method can be extended, for example, by 
employing arrays of multiple SWNT tips. 

In another embodiment, the nanotip comprises a single walled nanotube (SWNT). In 
a further embodiment, the nanotube tip comprises multiple SWNTs. In another further 
5 embodiment, the nanotube tip comprises an individual SWNT. 

According to another aspect of the invention, an apparatus for detecting single 
nucleotide polymorphisms of gene is disclosed which includes an atomic force microscope 
(AFM) with a seaming probe having at least one nanotube tip, scanning means for 
scanning said nanotube tip across a sample, and detection means for detecting with said 
10 nanotube tip a characteristic feature of at least a portion of the sample. The characteristic 
feature corresponds to a height and/or a spatial extent of said at least one sample portion, 
wherein said sample is a gene sample. In a further embodiment, the gene sample further 
comprises a probe. 

The tips of the AFM may employ high-resolution single-walled carbon nanotube 
1 5 (SWNT) probes, which can be reproducibly prepared with tip radii of less than 3 nm and 
about 2 base resolution, about 5 base resolution, or about 10 base resolution to enable high 
resolution, multiplex detection of different labels. 

In one embodiment, specific hybridization of labeled ohgonucleotide probes may be 
used to target sequences in DNA fragments, using PGR, followed by direct reading of the 
20 presence and spatial locations of flie labels by AFM. In this embodiment, the 

oligonucleotide probes are designed such that under appropriate hybridization conditions, 
binding does not occur in the presence of a single-base mismatch at polymorphic sites; i.e., 
labels are detected only at sequences fully complementary to the oligonucleotides. The 
oligonucleotides can be labeled with differing size molecules or compounds, which can be 
25 consistently distmguished from one another on the basis of size. 

In another embodiment, protein nucleic acids (PNA) are used as probes. In a further 
embodiment, the protein nucleic acids interact with double stranded DNA (dsDNA) for 
haplotyping which can be detected by visualization of the labeled DNA using AFM with a 
SWNT tip. 

30 In another embodiment, the gene is gDNA. In a further embodiment, the gDNA is 

comprises a PNA probe.. In yet another embodiment, gDNA comprises a PNA and is 
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separately labeled. In a further embodiment, gDNA comprises more than one PNA which 
have differing labels. In an even further embodiment, gDNA is labeled with both biotin- 
and fluorescein-terrainated PNAs. In an even further embodiment, gDNA comprises a 
biotinylated PNA is reacted is labeled with streptavidin- magnetic beads. In a further 
5 embodiment, the SWNT tip comprises an individual SWNT. 

In another embodiment, the apparatus further comprises a micron scale channel 
which elongates biosequences such as DNA along a well-defined direction. In a further 
embodiment, the micron scale channel comprises polydimetihylsiloxane (PDMS). 
Brief Description of the Drawings 
10 Figure lA shows schematic diagram of an atonic force microscope and image 

display. 

Figure IB shows schematic diagram of an curved silicon tip and a cross sectional 

view. 

Figure 2A depicts a fragment of a stick model of a single-walled carbon nanotubes. 
15 Figure 2B depicts a transmission electron micrograph of a cross-sectional structure 

of multi-walledcarbon nanotube. 

Figure 2C depicts a transmission electron micrograph of a cross-sectional structure 
of single-walled carbon nanotube. 

Figure 3 A shows schematically a Fe-catalyzed CVD growth process of nanotubes 
20 using ethylene. 

Figure 3B shows a CVD apparatus for controlled growth of nanotubes. 

Figure 4 shows a schematic illustration of the surface growth process used to 
prepare SWNT tips. 

Figures 5 A and B depict scanning electron microscope images of the CVD nanotube 
25 tip before (A) and after (B) electrical etching. 

Figures 5C shows a TEM image of the end of a shortened nanotube; 
Figure 6 A illustrates schematically one method of detection of labeled DNA sites 
with nanotube tips: labeled oligonucleotide probes are specifically annealed to their 
complementary target sequences (a and b) but not to sequences with a single base mismatch 
30 (A and B) in the ssDNA template. 
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Figure 6B illustrates one method of depositing labeled DNA molecules on freshly 
cleaved mica and imaging by AFM using SWNT probes. 

Figure 7A shows SWNT tip AFM image and height profile along DNA, obtained 
with streptavidin-labeled GGGCGCG in M13mpl8 digested with BgllL; the arrow points to 
5 the streptavidin tag; the image height scale is 3 nm and the white bar corresponds to 1 00 
nm. 

Figure 7B shows a histogram of number of streptavidin tags as a function of 
distance from the BglH restriction site, obtained from height plots along M13mpl8 labeled 
at GGGCGCG. 

1 0 Figure 7C shows a histogram of nxunber of streptavidin tags as a function of 

distance from the BglH restriction site, obtained from height plots along M13mpl8 labeled 
at GGGCGCG, where the sample was digested with BamBI and distances were measured 
from this site. 

Figure 7D shows possible positions of the target sequence, based on the calculated 
15 distance from the restriction sites; the labeled sequence occurs where two arrowheads meet 
(one from each digest); solid arcs indicate the correct paths, while incorrect paths are shown 
as dashed arcs with an "x" through them. 

Figure 8 A illustrates schematically direct haplotyping of UGTl A7 using SWNTs 
and shows haplotypes, alleles, genotypes, and locations of probes in samples analyzed, 
20 Figure 8B depicts representative SWNT tip AFM height image of the *3 allele 

(streptavidin end-labeled, ^140 nm DNA) detected m a sample that was heterozygous at 
both loci. 

Figure 8C depicts SWNT probe image of the *1 allele (IRD800 end-labeled, -210 
nm DNA) detected in tiie same sample as 8B; the height scale is 3 nm and the white bar 
25 corresponds to 50 nm in both images. 

Figure 8D depicts a histogram showing number of streptavidin (blue) and IRD800 
(red) end-labeled fragments vs. DNA length for a sample known to have either the (* l/*3) 
or (*2/*4) haplotype. 

Figure 8E depicts a histogram same as in 8D, but from a different individual; both 
30 samples were determined to have the (* l/*3) haplotype. 


wo 02/022889 PCT/USO 1/42 138 

-7- 

Figure 9A depicts a schematic of an approach to SNP and haplotype 
characterization by directly imaging peptide nucleic acid (PNA)-labelled and elongated 
DNA fragments where bold vertical lines represent each chromosome homolog and the 
letters specify the two polymorphic sites. 
5 Figure 9B depicts schematic representations of PNA labels targeting 129K13 IK and 

208R sites in the *l/*3 and *2/*4 haplotypes of UGT1A7 gene. 

Figure lOA depicts AFM images of a PNA labelled UGT1A7 gene. 

Figure lOB depicts AFM images of doubly-labelled UGT1A7 gene; the PNA labels 
are visible in the left image and the right image shows further conjugation with streptavidin 
1 0 with the maximum height difference between PNAs and dsDNA is about 0.4 ran. Inserts in 
the AFM images are the height profiles taken along the lines defined by flie white arrows. 
A sample shown in the top histogram illustrates two clusters of PNA labels which are 
separated by the average distance of 81 .3 nm (~ 239 base pairs). 

Figure IOC depicts a histogram representing the PNA label distribution on flie gene 
1 5 taken from another individual, showmg the presence of both labels at their expected 
locations, and displaying the characteristics of the *l/*3 haplotype. 

Figure 1 1 A depicts AFM images of ml 3mpl 8 reacted wifli PNAs (guided with 
white arrows in the left image) and additional streptavidin labels (right image) by targeting 
5*-ACGCGCC-3' and 5'-TCTCAGCC-3' sites; the two labels are identified at the average 
20 distance of 0,28 m and 1 .42 m from the restriction site which corresponds to the 

estimated positions of the 5'-ACGCGCC-3' (0.26 m) and 5'-TCTCAGCC-3' (L44 m) 
sites from the restriction site. 

Figure 1 IB depicts AFM images of ml3mpl8 reacted with streptavidin to the 
biotinylated ends of each PNA probe, 5'-TCTCAGCC-3' (A) and 5'-ACGCGCC-3'(B), 
25 showing sequence-specificity of PNAs.; each label is identified at their expected locations 
of 0.27 m and 1 .41 m from the restriction site as shown in the combined histogram plot. 

Figure 12A depicts a schematic representation of a PDMS microfluidic channel 

Figure 12B depicts an AFM image of elongated ml3mpl8 on a silicon substrate 
using the controlled flow within a PDMS channel of 100 m in width. 
30 Figure 12C depicts an AFM image of aligned lambda DNA on a silicon substrate 

using the controlled flow within a PDMS channel of 1 00 m in width. 
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Figure 12D depicts elongated, singly-labeled ml3mpl8 by conjugating streptavidin 
to the biotinylated PNA labels. 

Figure 13 depicts automated haplotype determination using multiple SWNT tip 

arrays. 

5 Figure 14 schematically depicts haplotyping with genomic DNA. 

Figure 15 schematically depicts a cystic fibrosis transmembrane conductance 
regulator (CFTR) mapping using a PNA reaction of gDNA, and labeled with streptavidin 
coated paramagnetic particles, and captured by a magnet 

Figure 16 depicts the PNA labels for SNP/haplotype mapping of the cystic fibrosis 

10 gene. 

Figure 17 shows AFM images of a SNP site labeled with streptavidin magnetic 
beads-biotm-PNA (S'CTTTATGCCACAGAAGCTAB') and PNA (S'GTCCGCTAGS') 
Figure 18 shows an AFM image of the Delta F508 site in a CFTR gene separated from 
gDNA 

15 Figure 19 shows examples of multiplexing in CFTR. 

Detailed Description 

The present invention is directed to haplotyping by direct visualization of 
polymorphic sites on individual genes. According to one aspect of the invention, a method 
is disclosed for detecting single nucleotide polymorphisms of gene santiples using an atomic 

20 force microscope (AFM). In one embodiment, the method of detecting single nucleotide 
polymorphisms of gene samples comprises the steps of a) providing the AFM with at least 
on nanotube tip, b) moving said nanotube tip across the gene sample, and c) recording an 
image obtained with said AFM. In an embodiment, the nanotube tips are comprised of 
single-walled nanotubes (SWNTs). In an embodiment, metal-catalyzed chemical vapor 

25 deposition (CVD) is used to achieve reproducibiUty in the growth characteristics of the 
nanotube tips. 

In an embodiment, the gene comprises a probe. A probe may include a nucleotide 
sequence such as an oligonucleotide which targets a specific sequence in a gene, for 
example in DNA or in a DNA fragment, A probe may also include a peptide nucleic acid. 
30 In one embodiment, two or more probes are used. For example, two probes may be 

designed to hybridize to a target sequence thereby generating a detectable signal whereby 
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the probing nucleobase sequence of each probe comprises the complement to about half of 
the complete target sequence. 

In a further embodiment, the probe comprises a label or detectable moiety which 
can be attached to an probe to render the probe detectable by AFM. Such labels may 
5 include derivatized avidin ,streptavidin, large monodisperse labelling protems, a dextran 
conjugate, a branched nucleic acid detection system, a chromophore, a fluorophore, for 
example, 5(6)-caiboxyfluorescein (Flu), 6-((7-amino-4-methylcoumarin-3- 
acetyl)amino)hexanoic acid (Cou), 5(and 6>carboxy-X-rhodamine (Rox), Cyanine 2 (Cy2) 
Dye, Cyanine 3 (Cy3) Dye, Cyanine 3.5 (Cy3.5) Dye, Cyanine 5 (Cy5) Dye, Cyanine 5.5 

10 (Cy5.5) Dye Cyanine 7 (Cy7) Dye, Cyanine 9 (Cy9) Dye, a spin label, a radioisotope, an 
enzyme, a hapten, for example, 5(6)-carboxyfluorescein, 2,4-dinitrophenyl, digoxigenm, 
and biotin, an acridinium ester, chemiluminescent compounds, and magnetic labels. Other 
labels mclude heteroduplexes. In another embodiment, the probe may be hybridized to a 
nucleic acid target to cause a detectable change in at least one physical property of at least 

1 5 one attached label in a manner which can be used to detect, or identify, the presence of a 
single nucleotide polymorphism. 

In a furflier embodiment, spacer or linker moieties may be used for example, to 
minimize the adverse effects that bulky labeling reagents might have on hybridization 
properties of probes. Linkers may induce flexibility and randonmess into the probe or 

20 otherwise link two or more nucleobase sequences of a probe or component 

polymer.Examples of spacer/linker moieties which may be used for the nucleobase 
polymers of this invention consist of one or more aminoalkyl carboxylic acids (e.g. 
aminocaproic acid) the side chain of an amino add (e.g. the side chain of lysine or 
ornithine) natural amino acids (e.g. glycine), aminooxyalkylacids (e.g. 8-amino-3,6- 

25 dioxaoctanoic acid), alkyl diacids (e.g. succinic acid), alkyloxy diacids (e.g. diglycolic acid) 
or alkyldiamines (e.g. l,8-diamino-3,6-dioxaoctane). Spacer/linker moieties may also 
incidentally or intentionally be constructed to improve flie water solubility of the probe 

The probes need not be labeled with a detectable moiety. When using probes, it is 
possible to detect the probe/target sequence complex formed, for example, by hybridization 

30 of the probing nucleobase sequence of flie probe to the target sequence. 
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One or more of the probes may be immobilized to a surface for the detection of 
SNPs. In one embodiment, two or more probes may be immobilized each at a specified 
position. Because the location and composition of each immobilized probe is known, 
arrays are generally useful for the simultaneous detection, or identification, of SNPs of two 
5 or more gene sequences. Arrays of probes may be regenerated by stripping the hybridized 
nucleic acid after each assay, thereby providing a means to repetitively analyze numerous 
samples using die same array. 
Definitions 

The term **biosequence" refers to a gene, genetic or protein sequence, peptide 
10 nucleic acid, or DNA or DNA fragment 

The term "genomic DNA" or "gDNA" generally refers to the entire length of DNA, 
including non coding regions. 

As used herein, the terms "label" and "detectable moiety" shall be interchangeable 
and shall refer to moietes which can be attached to a nucleotide, oligonucleotide or PNA 
1 5 probe to render the probe detectable by AFM. 

As used herein, the term "nucleobase" shall include those naturally occurring and 
those non-naturaUy occurring heterocyclic moieties commonly known to those who utilize 
nucleic acid technology or utilize peptide nucleic acid technology to thereby generate 
polymers which can sequence specifically bind to nucleic acids, 
20 The term "nucleobase sequence" shall include any segment of a polymer which 

comprises nucleobase containing subunits. Non-limiting examples of suitable polymers or 
polymers segments include oligonucleotides, oligoribonucleotides, peptide nucleic acids, 
nucleic acid analogs, nucleic acid mimics or chimems. 

A peptide nucleic acid (PNA) is a non-naturally occurring polyamide which can 
25 hybridize to nucleic acid (DNA and RNA) with sequence specificity (U.S. Pat No. 

5,539,082 and Eghohn et al., Nature 365: 566-568 (1993)). Being a ndn-naturally occurring 
molecule, uimiodified PNA is not known to be a substrate for the enzymes which are 
known to degrade peptides or nucleic acids. Therefore, PNA should be stable in biological 
samples, as well as have a long shelf-life. The hybridization of a PNA with a nucleic acid is 
30 fairly iadependent of ionic strength and is favored at low ionic strength, conditions which 
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strongly disfavor Ihe hybridization of nucleic acid to nucleic acid (Egholm et al, Nature, at 
p. 567). 

The terms "polynucleotide," "oligonucleotide," "nucleic acid" and "nucleic acid 
molecule", and "DNA" are used herein to include nucleotides of any length, either 
5 ribonucleotides or deoxyribonucleotides. This term refers only to the primary structure of 
the molecule. Thus, the term includes triple-, double- and single-stranded DNA, as well as 
triple-, double- and single-stranded RNA. It also includes modifications, such as by 
raethylation and/or by capping, and unmodified forms of the polynucleotide. More 
particularly, the terms "polynucleotide," "oUgonucleotide," "nucleic acid" and "nucleic acid 

10 molecule" mclude polydeoxyribonucleotides (containmg 2-deoxy-D-ribose), 

polyribonucleotides (containing D-ribose), any other type of polynucleotide which is an N- 
or C-glycoside of a purine or pyrimidine base, and other polymers containing 
nonnucleotidic backbones, for example, polyamide (e.g., peptide nucleic acids (PNAs)) and 
polymorpholino (conunercially available from tihe Anti-Virals, Inc., CorvaUis, Oreg., as 

15 Neugene) polymers, and other synthetic sequence-specific nucleic acid polymers providing 
that the polymers contain nucleobases in a configuration which allows for base pairing and 
base stacking, such as is found in DNA and RNA* There is no intended distinction in length 
between the terms "polynucleotide," "ohgonucleotide," "nucleic acid" and "nucleic acid 
molecule," and these terms will be used interchangeably. These terms refer only to the 

20 primary structure of the molecule. Thus, these terras include, for example, 3'-deoxy-2',5*- 
DNA, oligodeoxyribonucleotide N3T5* phosphoramidates, 2*-0-alkyl-substituted RNA, 
double- and single-stranded DNA, as well as double- and single-stranded RNA, DNArRNA 
hybrids, and hybrids between PNAs and DNA or RNA, and also include known types of 
modifications, for example, labels which are known in die art, metiiylation, "caps," 

25 substitution of one or more of flie naturally occurring nucleotides with an. analog, 

intemucleotide modifications such as, for example, those with uncharged linkages (e.g., 
methyl phosphonates, phosphotriesters, phosphoramidates, carbamates, etc.), with 
negatively charged linkages (e.g., phosphorofliioates, phosphorodithioates, etc.), and with 
positively charged linkages (e.g., aminoalklyphosphoramidates, 

30 aminoalkylphosphotriesters), those containing pendant moieties, such as, for example, 

proteins (including nucleases, toxins, antibodies, signal peptides, poly-L-lysine, etc.)> those 
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with intercalators (e.g., acridine, psoralen, etc.), those containing chelators (e.g., metals, 
radioactive metals, boron, oxidative metals, etc.), ttiose containing alkylators, those with 
modified linkages (e.g., alpha anomeric nucleic acids, etc.), as well as unmodified forms of 
the polynucleotide or oligonucleotide. In particular, DNA is deoxyribonucleic acid. 
5 The term "probe" refers to a biosequence which targets sequences in for example 

genes, DNA and DNA fragments. The probing nucleotide or nucleobase sequence of a 
probe may be the specific sequence recognition portion of the construct 

As used herein, the term 'target sequence" is a nucleic acid nucleobase sequence to 
which a least a portion of the probing nucleobase sequence is designed to hybridize. 

1 0 Atomic Force Microscope 

A schematic diagram of an atomic force microscope and image display is depicted 
in Fig. 1 A. The position of the cantilever/tip assembly is monitored during scanning with an 
optical deflection system. Fig. IB shows a scale model of a 5 ran radius of curvature silicon 
tip and a cross sectional view. The diameter of duplex DNA would correspond to the 

1 5 roughly circular black space surrounded by the GroES macromolecule and substrate (light 
rectangle). 

To exploit the intrinsic sensitivity and simplicity of AFM for DNA sequence 
detection, molecular scale tips are required fliat can be produced reproducibly. Carbon 
nanotubes consist of a honeycomb sp2 hybridized carbon network (termed a graphene 

20 sheet) that is rolled up into a seamless cylinder (Fig. 2a-c), which can be microns in length. 
There are two basic stractural classes of carbon nanotubes: single-walled nanotubes 
(SWNTs) and multi-walled nanotubes (MWNTs). SWNTs consist of a single seamless 
cylinder (Fig. 2a) with radii ranging from 0.35 to 2.5 nm, while MWNTs consist of multiple 
concentric graphene cylinders with radii ranging from 3-50 nm. Transmission electron 

25 microscopy (TEM) images of an individual MWNT (Fig. 2b) and SWNT (Fig. 2c) show 
clearly the tubular stracture of the nanotubes, and in the case of the SWNT (Fig. 2c) 
highlight the extremely small diameter and high aspect ratio, which makes this material 
xmiquely suited as a high-resolution probe tip. 

Nanotubes have exceptional mechanical properties. The characteristics relevant to 

30 the use of nanotubes as AFM tips are the stifBiess or Young's modulus and the ability to 
buckle elastically under large loads. Experimental measurements of Young's moduli made 
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by recording thermal vibration amplitudes in a TEM and by direct bending with AFM 
revealed Young's moduli of 1 .8 TPa and 1 .3 TPa, respectively, showmg that nanotubes are 
stiffer flian any other known material. The extremely high Young's modulus of nanotubes is 
desirable for the creation of high aspect ratio, sub-nanometer radius tips with high 
5 resolution — ^if the modulus was significantly smaller, then the amplitude of thermal 

vibrations would degrade the resolution of tips. Carbon nanotubes buckle elastically under 
large loads, unlike conventional materials that either facture or plastically deform. MWNTs 
and SWNTs have been observed in the buckled state by TEM, and appear to buckle in a 
maimer strikingly similar to macroscopic tubes made of elastic materials. The first direct 

1 0 experimental evidence that carbon nanotube buckling is elastic came from an early use of 
nanotubes as AFM tips. The buckling force of the nanotube tip could be measured fi:om the 
deflection of the AFM cantilever as the nanotube tip-surface separation was reduced. The 
nanotube would then return to its original configuration when the tip was removed from the 
surface. A more direct measurement of elastic nanotube buckling was achieved by AFM 

15 measurements of force versus displacement of nanotubes pinned at one end on a surface. 
Both types of experiments demonstrate that nanotubes can be bent close to 90-degrees 
maay tunes without observable damage, and thus are highly robust probes for AFM 
imaging. 

Carbon nanotube tips may be prepared by a direct synthesis of tips on standard 
20 atomic force microscopy cantilever/tip assemblies using metal-catalyzed CVD. In fliis 

process (Figs, 3a, b), a metal particle catalytically decomposes a hydrocarbon feedstock and 
nucleates the growth of a carbon nanotube. This feature of flie CVD process used for 
preparing nanotube probes provides conditions for the reproducible growth of SWNT tips 
having a consistent size and resolution and hence predictable characteristics, 
25 Tips synthesized using the electrochemically deposited iron catalyst may consist 

reproducibly of individual 3-5 nm radii MWNTs oriented optimally for high-resolution 
imaging. Significantly, these studies demonstrated that a well-defined synthetic approach 
could be used to prepare directly nanotube probes, thus opening flie possibility of precise 
control over nanotube size and thereby tip resolution. Recently, SWNT tips having much 
30 smaller radii of only 1-3 nm were reproducibly grown using well-defined iron oxide 
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nanocluster catalysts These latter tips begin to approach the theoretical minimum size 
expected for individual SWNTs. 

One process for producing CVD SWNT tips involves direct growth of SWNTs tips 
from the pyramid of a silicon cantilever-tip assembly shown in Figure 4, In this ^surface 
5 gro\yth' approach, the trade-off between the energy gain of the nanotube-surface interaction 
and the energy required to bend the nanotubes facilitates the reproducible growth of 
SWNTs from the silicon pyramid apex in the ideal orientation for high resolution imaging. 
This result can be understood by considering the fate of a nanotube during growth. When a 
growing nanotube reaches an edge of the pyramid, it can either bend to align with the edge 

1 0 or protrude from the surface. The pathway followed by the nanotube is determined by a 
trade-off in the energetic terms introduced above: if the energy required to bend the tube 
and follow the edge is less than the attractive nanotube-surface energy, then the nanotube 
will follow the pyramid edge to the apex; that is, nanotubes are steered towards tiie tip apex 
by flie pyramid edges. At the apex, the nanotube must protrude along the tip axis since the 

1 5 energetic cost of bending is too high. 

This steering of nanotubes to the pyramid apex to form ideal probe tips has been 
demonstrated experimentally (Fig. 5). For example, SEM investigations of nanotube tips 
produced by the surface growth method show that a high yield of tips contain nanotubes 
only at the apex, with very few protruding elsewhere from flie pyramid (Fig. 5). These tips, 

20 which have been produced under conditions expected to yield SWNTs, are sufficiently 
small so that SEM cannot provide an accurate measure of their diameters. TEM analysis 
was able to demonstrate that the tips consist of individual SWNTs and small SWNT 
bundles. In the case of the small SWNT bundles, the TEM images show that the bundles are 
formed by nanotubes coming together from different edges of the pyramid to join at the 

25 apex, thus confirming the surface growth model described above (Fig. 5c), 

The surface-growth approach for preparing mdividual SWNT tips appears to 
.advantageously lower the catalyst density on the surface such that only one nanotube 
reaches the apex. This approach may be readily extended to prepare multiprobe arrays. The 
reproducible growth of individual SWNT tips, which represent a unique and heretofore 

30 unavailable technology for molecular level detection and imaging, advantageously allows 
the detection and visualization of polymorphic sites on individual DNA molecules with 
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single nucleotide sensitivity, in particular the high-resolution detection of specific DNA 
sequences in kilobase firagments and haplotyping of a putative cancer related gene with two 
polymorphic sites, as will be described in more detail below. 

Well-defined iron nanoclusters may be used as a catalyst for CVD growth of 
5 nanotubes. The motivation for the use of preformed iron nanoclusters is two-fold. Catalysts 
with a well-defined diameter can control the diameter of nanotubes and nanowires and can 
lead to more reproducible initiation of active nanotube growth. The catalyst clusters can be 
made by solution phase synthesis, as a means of achieving greater control over growth. 
Another significant factor m defining the reproducibility of SWNT tip growth is dispersion 

10 of the catalyst on a micro-fabricated silicon pyramid, which serves as the support during 
surface growth. In general, the catalyst can be dispersed uniformly over the entire pyramid. 
This approach, however, can cause large fluctuations in the distribution of catalyst near the 
apex at the low catalyst concentrations used to achieve individual S WNTs, which can 
reduce reproducibility. To mitigate fluctuation and also enable a more efficient deposition 

1 5 of catalyst, a dual-purpose, semi-automated manipulation stage can be used for catalyst 
deposition on Si pyramids and DNA sample deposition on substrates. In this approach, the 
catalyst can be deposited in a defined outer region of the silicon pyramid through controlled 
transfer of a catalyst 'inked' elastomeric polydimethylsiloxane (PDMS) stamp. This can be 
achieved by depositing a thin, uniform layer of iron nanocluster catalyst onto a flat PDMS 

20 block by spin coating, and then bringing the pyramid into contact with the PDMS using the 
manipulation stage. Two characteristic features of this approach are that (1) it can enable 
confiinement of the catalyst to the outer 100 nm or less of the pyramid and (2) it can enable 
controlled batch processing of tip arrays, which is advantageous for large scale SWNT 
synthesis and for the creation of multiprobe arrays. 

25 The individual SWNT tips require relatively tight constraints on tip length to avoid 

loss of resolution due to thermal vibrations. It has been found that the amplitude of the tip 
vibration as a fimction nanotube length for several discrete dianieters for a radius of an 
individual SWNT tip of 0.5 nm, the length will need to be on the order of 10-20 nm to 
avoid loss of resolution. In other words, the nanotube lengfli has to be precisely controlled 

30 to take advantage of the potential resolution enhancements of individual SWNT tips. One 
method for controlling the length is an electrical etching method — by determining the 
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10 


15 


voltage dependence-which may aUow control of the tip shortening process on a nanometer 
scale. In one embodiment, pulsed electrical etching can be used to remove portions of the 
nanotube, wherein the amount removed by pulsed electrical etching depends directly on 
pulse height, with the length being reproducibly reduced in steps of ca. 2 nm or larger. The 
current shortening process hence provides sufficient control to optimize flie length of 
individual SWNT tips for imaging. 

In an embodiment, the SWNT tip length is controUed by direct growth; that is, 
defining the tip length by the growth time. This approach requires control of initial 
nucleation, growth rate and positioning of the catalyst, which aU can impact the 
reproducibiUty of the growth of individual SWNT. In one approach, nanotubes can be 
grown on planar substrates which enables rapid analysis of the resulting nanotube for a 
piedetermined set of conditions. 

It will be understood by those skilled m the art that the sample throughput can be 
increased at the image acquisition stage using parallel versus serial detection, for example, 
by employing multiprobe tip arrays for imaging. In addition, various modifications of the 
AFM can be implemented, such as high resonant frequency (>100 kHz) cantilevers in 
combination wilh fast system electronics to increase the potential image acquisition rate. 

An aspect of the present invention is the preparation and deposition of DNA 
samples or biosequences for simultaneous characterization and haplotyping of two or more 
20 distinct single nucleotide variations and detection of SNPs in large DNA samples. Since the 
AFM tip performs a controlled movement across the DNA molecule, elongation and 
alignment of DNA is desirable that are compatible with SWNT tip imaging. In one 
embodiment, reproducible sequence detection in large samples is attained by elongating the 
DNA to eluninate strand crossings, which could produce false positive signals. 
25 Within one embodiment, the fluid flow at the edge of evaporating drops is exploited 

to elongate DNA. This approach produces a radial alignment of the molecules on surfaces. 
This approach may also be used for SWNT AFM imaging using DNA samples in the 10- 
100 kb length range. In addition, the manipulation device, which was described above, may 
also be used for producing small periodic arrays of elongated samples, which can automate 
30 sample preparation. 
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rjhe evaporating droplet approach to DNA elongation cannot readily predict 
alignment of DNA relative to the AFM scan axes. Preferential alignment of multiple DNA 
samples with respect to one of the scan axes, however, can be advantageous for increasing 
throughput; that is, image data could acquired much more quickly for a high aspect ratio 
5 rectangle (n x m pixels with n » m) window versus a square (n x n) window. Another 
known approach for achieving both DNA elongation and preferential alignment involves 
the use of microfluidic stmctures. This approach utilizes a controlled laminar flow within a 
micron scale PDMS channel to elongate (flow) and align (unidirectional flow direction) the 
DNA in a defined maimer. 

10 Method A 

In one embodiment, specific hybridization of labeled oligonucleotide probes may be 
used to target sequences in DNA firagments, followed by direct reading of the presence and 
spatial locations of the labels by AFM.(Figure 6) The oligonucleotide probes are designed 
such that under appropriate hybridization conditions, binding does not occur in- the presence 

15 of a single-base mismatch at polymorphic sites; i.e., labels are detected only at sequences 
fiiUy complementary to the oligonucleotides. SWNT tips having tip radii of less than 3 ran 
were used, providing about 10 base resolution, for high resolution, multiplex detection of 
different labels. The exemplary oligonucleotides were labeled with eitiier streptavidin or the 
fluorophore IRD800, which can be consistently distinguished from one another on the basis 

20 of size. 

Fig. 6a shows labeled oligonucleotide probes ( -a* and -b*) that are specifically 
annealed to their complementary target sequences (a and b), but not to sequences with a 
single-base mismatch (A and B) in the single-stranded DNA template. DNA polymerase 
and dNTPs are then used to synthesize complementary strands, generating double-stranded 

25 DNA firagments specifically labeled at a and b with and , respectively. Fig. fib shows 
labeled DNA molecules deposited on freshly cleaved mica and imaged by AFM using 
SWNT probes. The presence and locations of the sequence specific tags ( and ) can be 
readily observed in the AFM image. It is apparent that due to the size of the tip, only the 
SWNT tip (left), and not the conventional Si tip (right), can be expected to provide 

30 sufficient spatial resolution. 
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Direct haplotyping using SWNT AFM probes appears to represent a significant 
advance over conventional approaches and could facilitate the use of SNPs for association 
and linkage studies of inherited diseases and genetic risk. These methodologies may be 
used to detect multiple SNPs in about 10 kilobase samples with a resolution of 
5 approximately 10 bases, and moreover, could be extendable to about 100 kilobase samples 
with similar resolution. The large DNA sizes that can be directly haplotyped are unique to 
the SWNT AFM technique and may be useful independent of the sample throughput. In 
addition, the simplicity and distinctiveness of the AFM images of alternative haplotypes 
indicate that automated analysis may also be feasible. The current throughput for an 

10 instrument imaging with a single SWNT tip may be greater than 200 samples/day with a 
redundancy of aboutlO independent images per sample. In addition, this approach could be 
extended from single sample analysis to a very high throughput parallel technique by 
exploiting multiple tip arrays, which have been fabricated in sizes as large as about 32x32 
for ultrahigh density hard-disk storage. The implementation of these technical 

1 5 improvements would allow haplotyping of over about 200,000 samples per day with a 
single instrument usmg our technique. Recent synthesis of carbon nanotubes with about 
0.25 nm radii, which are smaller ttian the spacing between DNA bases, indicates that 
further improvements in nanotube probes, nanotube end labeling and/or DNA labeling 
methods could enable direct reading of the DNA sequence of fragments that are tens of 

20 kilobases in size 
Method B 

In another embodiment, the gene sample comprises peptide nucleic acids with 
double strand DNA. This embodiment is illustrated in Fig. 9a which describes a PCR-free 
PNA labeling scheme, the subsequent DNA elongation when necessary, and flie label 
25 detection using an individual SWNT scanning probe. PNAs have shown great potential for 
use as probes to study single base mismatches in DNA as PNAs bind to dsDNA with high 
affinity, stability, and single base specificity. PNA is a DNA mimic in which the 
negatively-charged sugar backbone of DNA is replaced with an imcharged pseudo-peptide 
backbone. 

30 PNA/DNA hybrids have higher stability than DNA/DNA duplexes due to the lack 

of the electrostatic repulsion between the two strands. The probing nucleobase sequence of 
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a PNA probe is the specific sequence recognition portion of tbe construct Therefore, the 
probmg nucleobase sequence is a sequence of PNA subunits designed to hybridize to a 
target sequence to thereby be used to detect the presence, absence or number of organisms 
of interest in a sample. Consequently, with due consideration of the requirements of a PNA 
5 probe for the assay format chosen and the organism sought to be detected, the length of the 
probing nucleobase sequence of the PNA probe will generally be chosen such that a stable 
complex is formed with the target sequence under suitable hybridization conditions or 
suitable in-situ hybridization conditions. 

Those of ordinary skill in the art of nucleic acid hybridization wiU recognize that 

10 factors commonly used to impose or control stringency of hybridization include formamide 
concentration (or other chemical denaturant reagent), salt concentration (i.e., ionic 
strength), hybridization temperature, detergent concentration, pH and the presence or 
absence of chaotropes. Optimal stringency for a probe/target combination is often found by 
the well known technique of fixing several of the aforementioned stringency factors and 

1 5 then determining the effect of varying a single stringency factor. The same stringency 
factors can be modulated to thereby control the stringency of hybridization of a PNA to a 
nucleic acid, except that the hybridization of a PNA is fairly independent of ionic strength. 
Optimal stringency for an assay may be experimentally detennined by examination of each 
stringency factor until the desired degree of discrimination is achieved. 

20 In one embodiment, the duect interaction of peptide nucleic acid (PNA) labels with 

double stranded DNA (dsDNA) in haplotyping which can be readily detected by simple 
visualization of labeled DNA using atomic force microscopy (AFM) with an individual 
SWNT tip. Fig. 9B clearly demonstrates the super sharp geometry of an individual SWNT 
tip which is fabricated on a commercial tip apex using the 'pick-up' method. Individual 
25 SWNT tips typically show diameters of less than the DNA widfli, thus minimizing tip 
broadening effects. 

Detection of these PNA labels was accomplished by the use of individual SWNT 
scanning probes. SWNT probes provide routine high resolution imaging with flie average 
tip diameter and aspect ratio of 2 nm and 10:1, respectively.Using these SWNT tips the 
30 DNA resolution may be full width at half maximum (FWHM) of ~4 nm in width under 
typical ambient imagiug conditions. PNA labels exhibit maximum height difference of 


wo 02/022889 PCT/USOl/42138 


-20- 

only about 0.4 nm when compared to dsDNA whereas streptavidin labels conjugated to the 
biotinylated end of the PNA probes typically showed a change in hei^t greater than 1.5 nm 
(Fig.lOB). Therefore, the successful use of PNAs in haplotypmg, when accompanied by 
the utilization of individual SWNT probes, represents a significant advance towards a high- 
5 throughput technique for genomic screening. This ability to readily detect PNA labels 

enables a PCR-free, single-step labellmg reaction which avoids lengthy PGR procedures for 
label conjugation to denatured DNA and signal amplification wifli streptavidin. It also 
expands the size range of label choices by allowing detection of multiplex labels with 
various sizes where the lower detection limit is approximately few tenths of nanometers m 
10 height In one embodiment, fliis PCR-free labeling scheme employs PNA probes down to 
sizes as small as a 7-mer and incorporating an individual SWNT scanning probe to readily 
detect the PNA labels without the need of further conjugation with larger labels; 

Difficulties in SNP mapping of long DNA with an AFM include bending, kinking, 
coiling or crossing of micron-long DNA strands when deposited on the mica surface, which 
1 5 can hamper accurate label detection and effective image analysis. The alignment of DNA 
strands of tens of microns in length is required to overcome fliis issue in our direct unaging 
approach. In one embodiment, long DNA fragments, up to -15 microns in length, are 
dfrectionally aligned using controlled flow within micron-scale polydimethylsiloxane 
(PDMS) channel which can effectively elongate DNA along the flow dhection, as 
20 illustrated in Fig. 12A. Figs.l2B and 12C illustrate elongated ml3mpl8 and lambda DNA 
on a silicon/silicon oxide surface. This microfluidic ahgnment of DNA exhibit several 
advantages over the previous stretching techniques. The elongation of DNA is readily 
achieved without the use of centrifugal forces or electric fields and the alignment direction 
is well-defined. The directionality of aligned DNA can cooperatively enhance the 
25 efficiency of AFM detection. Fig.4D shows labelled mlSmplS DNA fragments which 
were reacted with PNA and then further conjugated with streptavidin. Periodically aligned 
DNA deposition as well as fabrication of multi-tip arrays may promote the assembly of an 
integrated system with automated DNA sample deposition, image acquisition, and image 
analysis capabilities for effective and accurate haplotype determination. 


30 
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Method C 

In yet another embodiment, SNP detection and haplotyping may be achieved on 
genomic DNA (gDNA) without amplifying the gene region. In one embodiment, single 
nucleotide polymorphism (SNP) haplotypes are determined in the cystic fibrosis 
5 transmembrane receptor gene, CFTR. The CFTR protein product is an ATP-binding 
cassette (ABC) transporter, fimctions as a chloride channel, and controls the regulation of 
other transport pathways. Homozygous mutations in this gene cause the respiratory disease 
cystic fibrosis In one embodiment, a biotinylated PNA tag is used to label specifically the 
genomic DNA pieces of CFTR. These maybe then optionally further conjugated with a 

10 moiety. In one embodunent, fliis moiety is streptavidin.. In another embodiment, further 
PNA labels may be employed to detect SNP sites. In a furflier embodiment, the moiety may 
be attached to super paramagnetic particles or labels, which may be used for separation 
purposes.(Figures 14 and 15) 

In another embodiment, the genomic DNA has more than one single nucleotide 

1 5 polymorphism. In a further embodiment, the single nucleotide polymorphisms further 
comprise probes. In an even further embodiment, the probes comprise different peptide 
nucleic acids. (Figure 19) 

Magnetic labels which may be used are beads or particles made either from 
nanometer-sized iron oxide crystallites, polymer impregnated with nanometer-sized iron 

20 oxide crystallites, or porous glass filled with iron oxide crystallites. These particles can be 
obtained with surface functional groups that may be used to immobilize molecules such as 
streptavidin, antibodies, or DNA. The particles are paramagnetic; that is, their 
magnetization is a function of the external magnetic field, and when the field is removed, 
the magnetization of the particles settles to zero. This "relaxation" does not happen 

25 instantly, but occurs over a period typically measured in microseconds or milliseconds, 

depending on the size of the iron oxide crystallites. Particles based on nanometer-sized iron 
oxide crystallites are sometimes termed "superparamagnetic", since their magnetization in a 
given magnetic field tends to be much greater than normal paramagnetic materials. 

Particles fabricated from ferromagnetic materials (such as NdFeB or nickel) or 

30 ferrimagnetic materials (such as micron-sized iron oxide or feirite particles) can also be 
used as magnetic labels. Both types of materials can be magnetized to a substantially 
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dissolved in water. 

Example 3 DNA sequence labeling in MlSmplS. 
Method A 

Biotin (Operon, Alameda, CA) or IRD800 (Li-Cor, Omaha, NE) labeled 
5 CGCGCCC (8 pmol) was annealed to 80 finol M13mpl8 in IX EcoPoI buffer (New 

England Biolabs, Beverly, MA) with 100 jiM dNTPs at 25*^0. Klenow Fragment exo" (New 
England Biolabs, 10 U) was added, allowed to sit at 25°C for 5 min, and then warmed to 
37°C for 30 min. Restriction digestion was carried out on -30 finol DNA with 5 U of BglR, 
BarnHl or Alwifl in the recommended digestion buffer (New England Biolabs) at 3TC for 

10 60 min. Digesting two separate aliquots with a different restriction enzyme is necessary 
because DNA fragment ends are indistinguishable by AFM. After digestion biotin labels 
were conjugated with streptavidin (7.5 pmol) at room temperature for 10 min in restriction 
buffer. Samples were ethanol precipitated and resuspended in 10 wM Tris, 1 mM EDTA 
(TE), pH 8.0, Multiplex labeling at the sequence GCTGAGA was performed the same as 

1 5 described above, except the annealing was carried out at 1 5°C. 
Example 4 Labeling UGTl A7 alleles. 
Method A 

PGR amplicons (--100 fmol) were denatured at 95''C for 10 min and then 
oligonucleotides (4 pmol) complementary to the 129N131R (IRD800-AATGACCGA) and 

20 208R (biotin-AGTACGGAA) loci were annealed at 24°C in IX EcoPol Buffer with 50 jiM 
dNTPs. Klenow Fragment exo" (2.5 U) was added, and the mixture was maintained at 24 °C 
for 2 min, followed by heating to 37*^0 for 30 min to extend the primed strands. Samples 
were purified with Concert PGR purification systems (Gibco BRL-Life Technologies, 
Grand Island, NY) to remove excess primers, and the DNA was resuspended in TE, pH 7.0. 

25 Streptavidin (0.5 pmol) was conjugated to biotinylated DNA in TE with 0.1 M NaCl for 2 
hr at room temperature. 

Example 5 Sample deposition and AFM imaging. 
Method A 

DNA was diluted to -100 pg/^il in 10 mM MgCl2 and deposited onto freshly 
30 cleaved mica for 5 min. Then the surface was rinsed several times with water and dried 
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greater magnetic moment than superparamagnetic particles, but fliey also retain their 
magnetism in the absence of an external magnetic field. 

After the reaction is completed, genomic DNA or DNA fi:agments may be separated 
from other pieces using a magnet since the molecular tag in the reaction was hybridized 
5 with super paramagnetic particles. SNP and haplotype detection was then perfonned using 
SWNT scanning probes directly on the separated and also aligned genomic DNA 
fragments. 
Exemplification 

The invention now being generally described, it will be more readily understood by 
10 reference to the following examples, which are included merely for purposes of illustration 
of certain aspects and embodiments of flie present invention, and are not intended to limit 
the invention. 

Example 1 UGT1A7 alleles . 

The alleles of UGT1A7 with theh Genbank accession numbers and genotypes at the 

15 polymorphic loci are UGT1A7*1, HSU39570 and HSU89507 (129N131R208W); 

UGT1A7*2, AF110191 (129K131K208W); UGT1A7*3, AF110192 (129K131K208R); 
UGT1A7*4, AFl 10193 (129N131R208R). The numbers and capital letters in the genotypes 
(i.e. 208R) correspond to the number and type of ammo acid in the protein encoded by the 
UGTl A7 gene at the polymorphic site. 

20 Example 2 PGR amplification of UGTl A7 samples. 

PGR primers (forward 5*-CTATCTGTACTTCTTCCACTTAC and reverse 5'- 
ACTTAC ATATCAACAAGAGCTGC) were designed to amplify a fragment which 
encompasses both polymorphic sites in the UGTl A7 first exon (from nt -76 to 1048 in tiie 
sequence corresponding to Genbank accession number HSU39570). PGR was perfonned on 

25 20 ng of genomic DNA in 50 ^1 aliquots containing 20 pmol of each primer, IX reaction 
buffer (50 mM KCl, 1.5 mM MgCl2, and 10 mM Tris pH 8.5), 100 dNTPs, 4% DMSO, 
and 2 U Tag DNA polymerase (PE Applied Biosystems, Branchburg, NJ). The 
amplification conditions were: denaturation at 94°G for 5 min, 5 cycles each consisting of 
60 s at 94°C, 45 s at 62^C, 90 s at 72°C followed by 30 cycles each consisting of 60 s at 

30 94*^C, 45 s at 56°C, 90 s at 72°C, followed by 7 min at 72**C. PGR products were purified 
using Qiagen quick columns (Qiagen, Santa Clarita, CA) to remove the primers and then 
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gently under a stream of nitrogen gas prior to AFM imaging. Images were recorded imder 
ambient conditions in tapping-mode at 1.5-2 Hz with a tip resonance frequency of 60-70 
kHz and amplitudes of 1 5-40 nm using a Digital Instruments (Santa Barbara, CA) 
Multimode Nanoscope Ilia. In contrast to previous contact-mode studies with micro- 
5 fabricated tips, flie relative humidity was found to have a minimal influence on measured 
DNA heights in our experiments, and thus no efforts were made to control ambient 
humidity. The insensitivity to humidity is due to the very small cylindrical structure of the 
nanotube probes, SWNT ropes were mounted on Au-coated force modulation etched silicon 
probe cantilevers (Digital Instruments, k=l-5 N/m) using micromanipulators under the 
10 direct view of an optical microscope as described previously. 
Example 6 Multiplexed sequence detection in M13mpl8. 
Method A 

The method according to the invention of detetcting haplotypes using SWNTs was 
employed to identify the spatial location of specific sequences with excellent discrimination 

15 from corresponding single-base mismatches in the M13mpl8 plasmid using seven base 
oligonucleotide probes. The essence of this experiment is captured in the AFM image of a 
DNA molecule that was marked with a streptavidin-labeled GGGCGCG sequence, as 
shown m Fig. 7A. This image shows a DNA fragment witih a 2200 nm contour length 
consistent with the 7249 bp of M13mpl8, and a distinct streptavidin label 1080 nm from 

20 one end of the BgM digested DNA. The arrow points to the streptavidin tag. Places where 
DNA strands cross each other (left side of height profile) are easily differentiated from 
labels. The image height scale is 3 nm and the white bar corresponds to 100 vsm. 

Figs. 7A shows a histogram summary of results obtained from at least 15 
streptavidin-labeled M13mpl8 DNA molecules showing a clear peak at 0.48 (3512 bp) 

25 from the BglO. restriction site for samples cut with BglH. Fig. 7B shows a similar histogram 
summary for samples cut with BgR of results obtained from the at least 15 streptavidin- 
labeled M13mpl8 DNA molecules showing a clear peak at 0.40 (2893 bp) from the Bamm 
restriction site. Fig. 7D shows a map of M13mpl8 with the location of GGGCGCG 
calculated from the histograms of Figs. 7B and 7C. Arrowheads indicate possible positions 

30 of the target sequence, based on flie calculated distance from the restriction sites; the 
labeled sequence occurs where two arrowheads meet (one from each digest). Solid arcs 
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indicate the correct paths, while incorrect paths are shown as dashed arcs with an "x" 
through them. 

In contrast, histograms &om control experiments with unlabeled oligonucleotides 
(data not shown) did not exhibit clusters of labels, indicating that the histogram peaks are 
5 due to specific detection of streptavidin. These results demonstrate two key points. First, 
based on the calculated distances of streptavidin labels from the two restriction site 
positions, the GGGCGCG site was determined to be at base 3390 (Fig. 7D). This same site 
was calculated to be at base 3402 firom similar experiments with IRD800 labeling (data not 
shown); both results are in excellent agreement with the known location (base 3405). 

10 Second, there is no evidence for labeling the single-base mismatch sites located at 11 1 5 and 
3595, thus demonstrating the specificity of labeling and the potential for SNP detection. 

Because the streptavidin and IRD800 molecules can be readily distinguished on the 
basis of their heights and sh^es (e.g. average measxjred height of streptavidin labels was 
1 .7 ± 0.5 imi vs. 0.7 ± 0.3 nm for IRD800) using the SWNT tips, simultaneous detection of 

15 two or more distinct sites is feasible. To test this concept, M13mpl 8 labeled at GGGCGCG 
with IRD800 and at TCTCAGC with streptavidin was prepared, and these fiagments were 
imaged using SWNT probes. Histograms similar to Fig. 7B, C were generated for 
streptavidin (>1 nm) and IRD800 (<1 nm) peaks detected in surface plots along imaged 
DNA Augments for BglR and AlwNl digests (data not shown). From these histograms, 

20 TCTCAGC was calculated to occur at bases 2024 and 4059, in good agreement with its 
known positions at 2013 and 4077, wifli GGGCGCG calculated to occur at base 3422, 
corresponding well with the expected value of 3405. These results demonstrate clearly the 
potential for multiplexed sequence detection in large DNA strands and open the possibility 
for profiling multiple polymorphic sites on DNA fi:agments in the 10 kilobase or larger size 

25 range. 

Example 7 Direct haplotvpe determination in UGTl A7. 
Method A 

Referring now to Figs. 8A-C, experiments were performed to identify specific haplotypes 
associated with genetic disorders. To illustrate this critical point, haplotypes were 
30 determined on a UDP-glucuronosyltransferase gene, UGTl A7, whose enzyme product is 
involved in inactivation of carcinogens such as benzo[a]pyrene metabolites, by using 
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SWNT probes. This gene has two polymorphic sites (separated by 233 bp) that determine 
four alleles, each specifying different polypeptide chains. Fig. 8 A shows schematically 
haplotypes, alleles, genotypes, and locations of probes in the analyzed samples. Individuals 
who are heterozygous at both sites have a single genotype. The (*l/*3) and (*2/*4) 
5 haplotypes, which have the same genotype, are specifically labeled at the 129N13 IR and 
208R sites with IRD800 (small filled circle) and streptavidin (large filled circle), 
respectively. These haplotypes, (*l/*3) or (*2/*4), cannot be differentiated usmg 
conventional methods. This ambiguity is crucial in screening, since each allele exhibits 
substantially different enzymatic activity towards targeted carcinogens. To distinguish these 

10 haplotypes using SWNT tips, allele specific probes were hybridized to DNA samples (Fig. 
8A). AFM images of a (*l/*3) sample should have an approximately equal number of 
' fragments that are -210 nm (663 bp) long with IRD800 at one end, or -140 nm (430 bp) 
long with streptavidin at one end (the random coil structure of the single-stranded DNA tail 
should not contribute significantly to the length). In contrast, a (*2/*4) sample should 

1 5 contain -2 1 0 nm fragments with IRD800 at one end and streptavidin -70 nm (233 bp) 

distant. The probes were chosen such that the (*l/*3) haplotype would show singly labeled 
fragments while the (*2/*4) haplotype would exhibit DNA with two or no labels. 

The subject samples could be unambiguously shown to exhibit the (*l/*3) 
haplotype by direct inspection of AFM images, Fig, 8B shows a representative SWNT tip 

20 AFM height image of the *3 allele (streptavidin end-labeled, -140 nm DNA) detected in a 
sample that was heterozygous at both loci. Fig. 8C shows a representative SWNT tip AFM 
height unage of the *1 allele (IRD800 end-labeled, -210 nm DNA) detected in the same 
sample. DNA molecules were only end-labeled with the streptavidin or the IRD800 probes. 
These images are characteristic of the *3 and *1 alleles, respectively. Control experiments 

25 on samples homozygous for the *2 allele showed no specific labeling (data not shown); 
because of the low occurrence frequency of the *4 allele, we did not test any samples 
known to carry tiiis allele. 

To fiirther substantiate ftiat flie haplotype determined by image inspection was 
indeed the consequence of specific probe hybridization to the expected sites on the target 

30 DNA, detailed length measurements were carried out. The histograms depicted in Figs. 8D, 
E plot the number of streptavidin and IRD800 end-labeled DNA fragments as a function of 
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length, for two different samples, each showing a grouping for streptavidin around 140 nm 
and for IRD800 near 210 nm. The cluster of streptavidin tagged DNA at -140 nm is typical 
of the *3 allele, and the grouping of IRD800 labeled fragments at -210 nm indicates the *1 
allele. These fragment distributions for both samples are in agreement with that expected 
5 for the (♦l/*3) haplotype (Fig. 8 A). Furthermore, no observed fragments matched the 
predicted profile of the *4 allele (Fig. 3 A); hence, both of these samples were of the 
haplotype (*l/*3). Fig. 8E shows a similar histogram as Fig. 8D, except that the histogram 
of Fig. 8E is from a different individual; however, both samples were determined to have 
the (*l/*3) haplotype. 
10 Example 8 PNA labelling in UGT1A7 and M13mpl8. 
Method B 

PNA labels for UGTl A7, biotm-5'-TTCCGTACTA-3' and biotin-S'-TTTTTT-Lys- 
Lys-TTTTTTGTCC-3' were acquired fxom PB Applied Biosystems (Branchburg, NJ). 0.1 
mol of PNA was reacted to -I nmol of UGTl A7 with 0.05M NaCl at 47°C for 120 min. 

15 Further conjugation with streptavidin (10 pmol) was carried out, when necessary, in room 
temperature for 30 min. after ethanol precipitation of the reaction products. The same 
purification process was repeated to the final product and the sample was resuspended in 10 
mM Tris, 1 mM EDTA (TE), pH 8.0. PNA labels wiflx sequences of biotin-5'-ACGCGCC- 
3' and biotin-5'-TCTCAGCC-3' were purchased from PE Applied Biosystems to target 

20 ml3mpl8. Linearization of ml3nq)18 was carried out on ^30 finol DNA with 5 U of Xbal 
in the reconmiehded digestion buffer (New England Biolabs) at 37°C for 60 min. After 
digestion, the reaction mixture was pxirified usmg Qiagen quick columns and the purified 
DNA was redissolved in TE. PNA labels (0. 1 mol) were added with 0.05M NaCl and the 
mixture was incubated at 40°C for 150 min. Samples were then ethanol precipitated and 

25 resuspended in TE, pH 8.0. PNA labels were conjugated with streptavidin (1 pmol) at room 
temperature for 30 min. Ethanol precipitation and resuspension in TE, pH 8.0 were 
repeated. 

Example 9 DNA alignment using PDMS microchannels. 
Method B 

30 A silicon master containing channel (100 m m width and 3 mm in length) was produced 
using standard photolithography procedures. PDMS elastomer channels were produced 
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from the master using Sylgard 1 84 cured at 70*'C for 1 80 min, PDMS elastomers seal 
conformally when fliey are brought into contact with poly-L-lysine (Ted Pella, Redding, 
CA) treated silicon oxide/silicon substrates. 2 1 of DNA in TE solution 1 pmol) was 
mixed with 5 1 of pure ethanol (Sigma Aldrich, Saint Louis, MO) immediately before the 
5 flow alignment The typical flow rate and duration were 1 mm/s and 5 min. respectively. 
After the removal of the PDMS, the sample was rinsed with deionized water and dried with 
nitrogen gas. 

Example 10 Sample deposition and AFM ima^ng . 
Method B 

10 DNA was deposited onto freshly cleaved mica for 5 min, with 10 mM MgCl2. Then the 
surface was rinsed several times with deionized water and dried gently under a stream of 
nitrogen gas prior to AFM imaging. Aligned DNA samples prepared using silicon 
substrates were treated in the same way for the rinsmg and drying. Images were recorded at 
1 .5-2 Hz with a tip resonance frequency of 60-70 kHz and an amplitude of 10-20 nm using 

1 5 a Digital Instruments (Santa Barbara, CA) Multimode Nanoscope nia. AFM tips with 
mdividual SWNT were prepared by the 'pick-up* method using FESP Si cantilevers 
(Digital Instruments, k==l-5 N/m). 
Example 1 1 
Methods 

20 PNAs are used forhaplotyping the UDP-glucuronosyltransferase gene, UGTl A7. The 
UGTl A7 gene ,the subject of extensive investigation due to its role in protection against 
benzo(a)pyrene- and aromatic amine-induced cancer, has two polymorphic sites which 
determine four alleles and specific combination of these alleles, haplotypes, are associated 
with potential risk factors which are caused by enviromnental carcinogens. Unambiguous 

25 determination of haplotypes of this gene, i.e., identification of alleles occurring for each 
copy of chromosomes, is therefore crucial to delineate the precise haplotypes associated 
with genetic disorders. AFM images in Fig. 1 OA clearly display PNA labels on the 
UGTl A7 gene which were consistently detected with SWNT tips. 
Custom designed PNA probes were used to tag polymorphic sites of this gene, as shown in 

30 schematic representations in Fig.9C. PNA probes were designed to delineate specifically 
the *l/*3 haplotype from the *2/*4 haplotype which were found in individuals who are 
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heterozygous at the two polymorphic sites having a single genotype, but have one of two 
haplotypes. Differentiation of tiiese two haplotypes is particularly important since it 
cannot be readily determined by conventional methods. Two PNA probes, specifically 
aiming at 208R and 129K131K sites respectively, target DNA sequences that are fully 
5 complementary to the probe sequences under a proper reaction condition. Differentiation 
of the two haplotypes can be achieved by a direct visualization with AFM as well as by 
examining the presence and spatial locations of these PNA labels. For example, the *l/*3 
haplotype samples will show both labels on the same DNA fiagment whereas the *2/*4 
haplotype will exhibit a single label on each DNA fragment. Therefore, immediate 
10 distinction of the two haplotypes based on the simple visualization of the gene is possible 
with our detection method; DNA samples shown in the bottom AFM images of Fig. 1 OB 
have, for example, the *l/*3 haplotype. 

Statistical analysis on the spatial locations of PNA labels on DNA strands were performed 
to substantiate accurate haplotype determination. A sample shown in the upper histogram 

1 5 in Fig.9B displays two peaks of PNA labels which are separated by the average distance of 
81.3 nm (- 239 base pairs). Then: average positions were measured as 30.7 nm and 1 12.0 
nm from one end of the gene with the average total length of 235.2 nna. The results show 
an excellent agreement with the actual locations of 129K131K (-97 bp) and 208R (--333 
bp) sites on the gene where these two polymorphic sites are separated by -236 base pairs. 

20 Therefore it is determined that this individual's haplotype is * l/*3. The lower histogram 
shows the result of the PNA label distribution on the gene taken from another individual, 
showing the presence of both labels at their expected locations, and thus indicating the 
same haplotype for this individual. Allele-specific PNA interaction was repeatedly 
monitored in samples from other individuals as well (data not shown); 1) the UGTl A7 gene 

25 of -1 123 bp in size was analyzed where only a single peak was resolved in the PNA label 
histogram. The position of the label, 206.7 nm from one end, confirms the existence of the 
129K1 3 1 K site but the absence of the 208R site, thus exhibiting the characteristic of a 
haplotype consisting of only *2 allele. 2) The PNA label distribution showed no clusters of 
labels when another DNA specimen was analyzed, indicating the *1/*1 haplotype. 

30 Example 12 
Method B 


V 
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A model system, ml3mpl 8 and PNA probes with single base specificity were used to target 
specific sequences corresponding to single or multiple sites on linearized ml3mpl 8 whose 
average length was determined as 2.35 mby AFM, Two PNA probes were employed to 
target 5'-ACGCGCC-3' and 5'-TCTCAGCC-3' sites respectively. Single base mismatch 
5 sites for these sequences are present at multiple positions on ml 3mpl 8. However, these 
PNA probes reacted only with the DNA sequences that were completely complementary to 
the probe sequence under appropriate reaction conditions and binding does not occur in the 
presence of a single-base mismatch at polymorphic sites, demonstrating the exceptional 
sequence-selectiveness of PNA probes in SNP mapping of DNA up to --7 kilobases in 

10 length. The histogram in Fig.ll A shows that the two labels are located at the average 

distance of 0.28 m and 1 .42 m from the restriction site which nicely corresponds to the 
esthnated positions of the 5'-ACGCGCC-3' (0.26 m) and 5'-TCTCAGCC-3' (1.44 m) 
sites fix)m the restriction site. Results shown in Fig. 11 B further confirm the PNA efiBciency 
in SNP mapping with its excellent discrimination from the existing single base mismatch 

15 sites. 

Example 13 PGR amphfication of CFTR alleles. 
Method C 

PGR primers (forward 5'-AGGGCATAGCTCTGTGGCATAAAG and reverse 5'- 
GAACTGTCTCTCAGGGGTATTTCTCC) were designed to amplify a 4265bp fragment 

20 containing part of introns 9 and 10, and the entire sequence for exon 10 (from nucleotide 
10109 to 14374 in the sequence corresponding to GenBank accession number ACOOOl 11). 
PGR was performed on 50ng of genomic DNA in 50ul aliquots containing 20pmol of each 
primer, IX reaction buffer (40mM Tri-HCL pH 9.3, 15mM KOAc, and 0.2% Triton X- 
100), 1.1 mM Mg(0AC)2, lOOuM dNTPs, and IX Advantage Genomic DNA or IX 

25 Advantage cDNA polymerase (Clontech, Palo Alto, CA). The amplification conditions 
were as follows: denaturation at 94**C for 5 min, 40 cycles each of 15s at 94*^0, 6 mm at 
68**C, followed by 10 min at 72*'C. PGR products were gel purified on a 0.7% agarose gel 
followed by purification using Qiagen QIAquick columns (Qiagen, Santa Clarita, CA) to 
remove primers and then dissolved m ImM Tris-HCl pH 8.5. 

30 Example 14 PNA labelling in Amplified CFTR. 
Method C 


1 
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PNA labels for CFTR as shown below in Table 1 and Figure 16 were acquired from PE 
Applied Biosystems (Branchburg, NJ). 0. 1 mol of PNA was reacted to ~1 0 nmol of 
CFTR with 0.1 M NaCl at 47°C for 180 min, Furflier conjugation with streptavidin (10 
pmol) was carried out, when necessary, in room temperature for 30 min. after ethanol 
5 precipitation of the reaction products. The same purification process was repeated to the 
final product and the sample was resuspended in 10 mM Tris, 1 mM EDTA (TE), pH 8.0. 


PNA Reference Marker 

Biotin-5 '-CTTTATGCCACAGAGCTA-3 ' 

PNA label for Site 1 

Biotm-5'-CTTGCCTCCA-3' 
FIuorescein-5 ' -CTTGCCTCC A-3 ' 

PNA label for Site 2: 
Delta F508 

Biotin-5 '-ATCATCTTTG-3 ' 
Fluorescein -5'-ATCATCTTTG-3' 

PNA label for Site 3 

Biotin-5'-GTCCGCTAG-3' 
Fluorescein-5'-GTCCGCTAG -3' 


Example 15 PNA labelling in genomic PNA. 
10 Method C 

Genomic DNA was obtained from Coriell Institute for Medical Research (Camdon, 
NJ) and diluted in 10 mM Tris, 1 mM EDTA (TE), pH 8.0 to a concentration of -100 nmol. 
Two methods were used to randomly reduce sizes of gDNA fragments; 1) The solution was 
spun at 14000 ipm for 20 min prior to PNA labeling and 2) Digestion of gDNA was carried 

1 5 out witti 2U otBgin in the recommended digestion buffer (New England Biolabs) at 2iTC 
for 60 min. After digestion, the reaction product was purified using Qiagen quick columns 
and the purified gDNA was redissolved in TE. 0. 1 mol of PNA was then reacted to 
gDNA with 0. 1 M NaCl at ATC for 1 80 min. Streptavidm coated superparamagnetic 
particles were purchased from Promega Corporation (Milwakee, WI) and washed in pure 

20 ethanol three times using a magnetic stand (Promega Corporation, Milwakee, WI). PNA 
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labeled gDNA was reacted with streptavidin coated superparamagnetic particles for 30 min 
at room temperature. After conjugation, the gDNA reaction mixture was kept overnight at 
4 **C on a magnetic stand. Magnetically captured genomic DNA fragments were separated 
and aligned on a freshly cleaved mica substrate by spiiming gDNA with MgCU solution at 
5 8000 rpm for 30 sec. The sample was then rinsed with deionized water and blow-dried 
under a gentie stream of N2 gas. AFM images were recorded at 1 .5-2 Hz wi± a tip 
resonance frequency of 60-70 kHz and an amplitude of 10-20 nm using a Digital 
Instruments (Santa Barbara, CA) Multimode Nanoscope Ilia. AFM tips with individual 
SWNT were prepared by the *pick-up' method using FESP Si cantilevers (Digital 
1 0 Instruments, k=l -5 N/m). 
Example 16 
Method C 

Genomic DNA is used without amplifying the gene region of CFTR. A biotinylated 
PNA tag is used to label specifically the genomic DNA pieces of CFTR which were then 

15 further conjugated with streptavidin. At the same time, other PNA labels were used to 
detect SNP sites, for example delta F508 site. (Figure 18) The streptavidin in this case was 
attached to super paramagnetic particles and it reacts only with the biotinylated PNA tag 
which is specially designed such that it only binds with the CFTR gene. After the reaction 
was completed, genomic DNA fragments of CFTR was separated from other pieces using a 

20 magnet since the streptavidin in the reaction was hybridized with super paramagentic 

particles, SNP and haplotype detection was then performed using SWNT scanning probes 
directly on the separated and also aligned genomic DNA fragments. 
Example 17 Mulitplexing 
Method C 

25 Both biotin- and fluorescein- terminated PNA labels were used to target specific 

SNP sites of CFTR gene. The biotinylated PNA label was subsequently conjugated with 
streptavidin. This reaction resulted in multiple SNP sites which can be easily differentiated 
by AFM on the basis of different label sizes. (Figure 19) 
Incorporation by Reference 

30 All of the patents and publications cited herein are hereby incorporated by 

reference. 
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Equivalents 

Those skilled in the art will recognize, or be able to ascertain using no more than 
routine experimentation, many equivalents to the specific embodiments of the invention 
described herein. Such equivalents are intended to be encompassed by the following claims. 
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We claim: 

1 . A method of detecting single nucleotide polymorphisms of gene samples using an 
atomic force microscope (AFM), comprising the steps of: 

providing the AFM with at least one nanotube tip, 
5 moving said nanotube tip across the gene sample, and 

recording an image obtained with the AFM. 

2. The method of claim 1 , wherein the nanotube tip comprises a single walled 
nanotube. 

3. The method of claim 2, wherein the nanotube tip comprises multiple single walled 
10 nanotubes. 

4. The method of claim 2, wherein the nanotube tip comprises an individual single 
walled nanotube. 

5. The method of claim 1, wherein the gene sample is a DNA fragment 

6. The method of claim 5, wherein the gene sample is a DNA fragment containing 
1 5 between 1 0 and 1 0,000 bases. 

7. The method of claim 5, wherein the DNA fragment comprises an amplified DNA 
fragment using a polymerase chain reaction. 

8. The method of claim 6, wherein the DNA fragment ftirther comprises a probe. 

9. The method of claim 6, wherem flie DNA fragment ftirther comprises an 
20 oligonucleotide probes designed to bind at sequences complementary to the 

oligonucleotides. 

10. The method of claim 9, wherein said oligonucleotide probes ftuiher comprise a label 
which can be detected by atomic force microscopy. 

1 1 . The method of claim 10, wherein said label is streptavidin. 

25 12. The method of claim 8, wherein the probe conq)rises a peptide nucleic acid. 

13. The method of claim 12, wherein the peptide nucleic acid probe further comprises a 
label which can be detected by atonric force microscopy. 

14. The method of claim 1, whereiu the gene is genomic DNA. 

15. The method of claim 14, wherein the genomic DNA further comprises a probe, 
30 16. The method of claim 1 5, wherein the probe comprises a peptide nucleic acid. 
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17. The method of claim 1 6, wherein the genomic DNA further comprises a label which 
can be detected by atomic force microscopy, 

18. The method of claim 14, wherein the genomic DNA comprises more than one 
probe. 

5 19. The method of claim 1 8, wherein said probes are different 

20. The method of claim 1 8, wherein said probes are the same. 

21. The method of claim 1, wherein said gene has more flian one single nucleotide 
polymorphism. 

22. The method of claim 21, wherein said smgle nucleotide polymorphisms further 
10 comprise probes, 

23. The method of claim 22, wherein said probes comprise different peptide nucleic 
acids. 

24. The method of claim 1 7, wherein said label comprises a paramagnetic label. 

25. The method of claim 14, wherein the genomic DNA comprises a cystic fibrosis 
1 5 transmembrane receptor gene. 

26. The method of claim 25, wherein the cystic fibrosis transmembrane receptor gene 
further comprises a peptide nucleic acid probe with the sequence given in SEQ ID 1 . 

27. The method of claim 1 , wherein the nanotube tip has a radius of less than 3 nm. 

28. The method of claim 1, wherein the AFM comprises arrays of single walled 
20 nanotube tips. 

29. The method of claim 1, wherein the gene sample comprises an array of gene 
samples. 

30. An apparatus for detecting single nucleotide polymorphisms of gene, comprising: 

an atomic force microscope (AFM) with a scanning probe having at least 
25 one nanotube tip, 

scaiming means for scanning said nanotube tip across a sample, 
detection means for detecting with said nanotube tip a characteristic feature 
of at least a portion of the sample, said characteristic feature corresponding to a 
height and/or a spatial extent of said at least one sample portion, 
30 wherein said sample is a gene sample. 
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31. The apparatus of claim 30, wherein the nanotube tip is comprised of a single walled 
nanotube. 

32. The apparatus of claim 31, wherein the nanotube tip is comprised of multiple single 
walled nanotubes. 

5 . 33. The apparatus of claim 31, wherein the nanotube tip is comprised of an individual 
walled nanotube. 

34. The apparatus of claim 30, wherein the gene samples further comprise a probe. 

35. The apparatus of claim 34, wherein the gene samples comprise oligonucleotide 
probes. 

10 36. The apparatus of claim 34, wherein the gene samples comprise peptide nucleic 
probes. 

37. The apparatus of claim 34, wherein the gene samples are genomic DNA. 

38. The apparatus of claim 30, wherein the nanotube tip has a radius of less than 3 nm. 

39. The apparatus of claim 30, further comprising a micron scale channel. 

15 40. The apparatus of claim 29, wherein the AFM comprises arrays of single walled 
nanotube tips. 


wo 02/022889 


PCT/USOl/42138 




SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCT/USOl/42138 


2/23 



FIG. 1B 


SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 PCT/USOl/42138 

.3/23 



FIG. 2B FIG. 2C 


SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCT/USOl/42138 


4/23 


0 


CH2CH2 


650 - 850 °C 


CARBON NANATUBE 


FIG. 3A 


Ar y- MFC 


M 


M 

F 


F 

C 


C 






TIPS 


FURNACE 



EXHAUST 



FIG. 3B 


SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 PCT/USOl/42138 


5/23 


DEPOSIT 
CATALYST 



FIG. 4 


SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCT/USOl/42138 


6/23 



6) 


wo 02/022889 


PCT/USOl/42138 


7/23 



SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCT/USOl/42138 


.8/23 



LENGTH 

FIG. 7A 


Bg/W 


BamH\ 



FIG. 7D 


8H 


Q 

515 4H 


— I — 
0.1 


0.2 



0.3 


0.4 


I 


0.5 


FRACTIONAL DISTANCE FROM RESTRICTION SITE 


FIG. 7B 



0.1 0.2 0.3 0.4 
FRACTIONAL DISTANCE FROM RESTRICTION SITE 


FIG. 7C 


SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCTAJSOl/42138 


9/23 


(*1/*3) 


208W 
129N131R 


208R 
M29K131K 

*3 


('2^4) 


ii 208W 
129K131K 


208R 
129N131R 


IRD800 O 


^ streptavidin 




FIG. 8A 




FIG. 8B FIG. 8C 

SUBSTITUTE SHEET (RULE 26) 


PCT/USOl/42138 


10/23 



wo 02/022889 PCT/USOl/42138 



SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCT/USOl/42138 


:1:2/23 


1 


CO 
I 

< 

o 


u. 


D 


CM 
O 


S2 


1 


CM 
I 

CD 
O 


/ 


E 
c 

o 


O 



o 

CM 


I 

CO 

o 


SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCT/USOl/42138 


13/23 


10-1 



30 60 90 120 150 180 210 240 
LENGTH (nm) 


FIG. 10C-1 


10-1 


[2 8-1 
z 

LU 

5 6-1 
O 


£ 4— 


< 

Q 2- 


T — r — r "c "I — I — I — I 

30 60 90 120 150 180 210 240 


LENGTH (nm) 

FIG. 10C-2 


SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCT/USOl/42138 


14/23 



I 

< 

CD 

LL 



SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCT/USOl/42138 


1^5/23 



QQ 


1 

O 


CD 


I 

CD 


"T 


CM 


.00 

"o 


Qb 

o 


CNJ 

o 


SlN3IAI9Vyd VNQ 


mmm. ■ .^•"/.^-v;.:.:-r^o'^■... 


CM 
I 

CO 


CD 


Oiu 

is 


CO 
I 

CO 



I 

CD 


LL 


SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCT/USOl/42138 


16/23 



FIG. 12C FIG. 12D 

SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCT/USO 1/42138 


17/23 



SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 PCTAJSOl/42138 

18/23 


I- 
LU 
Z 



SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


PCT/USOl/42138 


19/23 




wo 02/022889 


PCT/USOl/42138 


20/23 


n 

CO 


(0 o 

» t; ^ 

o < < 

•O U Z 

!^ 5 


o 
o 
< 
o 
o 


0) 
X) 

3 

Q. 



O 
X 

E 
c 

o 
m 


Q 

CO 

g 

LL 


< 
CO 


o 
u> 

X 

E 
c 


o 

CO 

g 

LJ. 


g 

< 

% 

< 

o 
o 
o 


E 


CD 

I 


Q. 

ID 

9 


01 
CO 


: 


QQ 
CO 


SUBSTITUTE SHEET (RULE 26) 



SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 PCT/USOl/42138 


22/23 



FIG. 18 


SUBSTITUTE SHEET (RULE 26) 


wo 02/022889 


23/23 


PCTAJSOl/42138 



SUBSTITUTE SHEET (RULE 26) 


This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the appUcant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SffiES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 


